Optimal Sensor Scheduling via Non-standard Multi-armed Bandit Formulation

نویسندگان

  • Yunxia Chen
  • Qing Zhao
  • Vikram Krishnamurthy
  • Dejan Djonin
چکیده

This paper addresses optimal sensor scheduling for maximizing network lifetime. We formulate this problem as a nonstandard multi-armed bandit process with non-discounted reward and finite stopping time. We find that the optimal strategy should choose the sensor that has the largest Gittins’ index. Exploiting the underlying structure of sensor scheduling problem, we reduce the computational complex of the Gittins’ index from O(N) to O(N) with respect to the network size N . Moreover, we derive closed-form expression for the Gittins’ index. We also show that choosing the sensor with the most residual energy is an optimal strategy when the channel fading is independently and identically distributed across sensors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finite dimensional algorithms for the hidden Markov model multi-armed bandit problem

The multi-arm bandit problem is widely used in scheduling of traffic in broadband networks, manufacturing systems and robotics. This paper presents a finite dimensional optimal solution to the multi-arm bandit problem for Hidden Markov Models. The key to solving any multi-arm bandit problem is to compute the Gittins index. In this paper a finite dimensional algorithm is presented which exactly ...

متن کامل

Sensor Scheduling for Multi-parameter Estimation under an Energy Constraint

We consider a sensor scheduling problem for estimating multiple independent Gaussian random variables under an energy constraint. The sensor measurements are described by a linear observation model; the observation noise is assumed to be Gaussian. We formulate this problem as a stochastic sequential allocation problem. Due to the Gaussian assumption and the linear observation model, this proble...

متن کامل

Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem

In this paper we investigate human exploration/exploitation behavior in sequential-decision making tasks. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief ...

متن کامل

Bayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem

In this paper we investigate human exploration/exploitation behavior in sequential-decision making tasks. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief ...

متن کامل

Bayesian and Approximate Bayesian Modeling of Human Sequential Decision-Making on the Multi-Armed Bandit Problem

In this paper we investigate human exploration/exploitation behavior in a sequential-decision making task. Previous studies have suggested that people are suboptimal at scheduling exploration, and heuristic decision strategies are better predictors of human choices than the optimal model. By incorporating more realistic assumptions about subject’s knowledge and limitations into models of belief...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005